Linguist's Assistant: A Multi-Lingual Natural Language Generator based on Linguistic Universals, Typologies, and Primitives

نویسندگان

  • Tod Allman
  • Stephen Beale
  • Richard Denton
چکیده

Linguist’s Assistant (LA) is a large scale semantic analyzer and multi-lingual natural language generator designed and developed entirely from a linguist’s perspective. The system incorporates extensive typological, semantic, syntactic, and discourse research into its semantic representational system and its transfer and synthesizing grammars. LA has been tested with English, Korean, Kewa (Papua New Guinea), Jula (Cote d’Ivoure), and North Tanna (Vanuatu), and proof-of-concept lexicons and grammars have been developed for Spanish, Urdu, Tagalog, Chinantec (Mexico), and Angas (Nigeria). This paper will summarize the major components of the NLG system, and then present the results of experiments that were performed to determine the quality of the generated texts. The experiments indicate that when experienced mothertongue translators use the drafts generated by LA, their productivity is typically quadrupled without any loss of quality.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Linguist's Assistant: A Resource For Linguists

The Linguist’s Assistant (LA) is a practical computational paradigm for describing languages. In this paper we describe how to use LA with naturally occurring texts that exemplify interesting target-language linguistic phenomena. We will describe how such texts can be semantically analyzed using a convenient semi-automatic document authoring interface, in effect adding them to LA’s standard sem...

متن کامل

The Re-use of Linguistic Resources across Languages in Multilingual Generation Components

An approach to generation system design is described which supports maximal expression of commonalities across languages. Within this approach it becomes natural to represent inherently multilingual grammars and semantics. The approach rests on the linguistic notion of functional similarity and difference: by capturing the functions languages need to perform , we achieve a level of linguistic d...

متن کامل

Using Linguist's Assistant for Language Description and Translation

The Linguist’s Assistant (LA) is a practical computational paradigm for describing languages. LA seeks to specify in semantic representations a large subset of possible written communication. These semantic representations then become the starting point and organizing principle from which a linguist describes the linguistic surface forms of a language using LA's visual lexicon and grammatical r...

متن کامل

Spatial Language and Geographic Information Systems: Cross-linguistic Issues (el Lenguaje Espacial Y Los Sistemas De Informacion Geograficos: Temas

The great majority of existing geographic information systems have been designed by English or German speakers. Since human natural languages impose structure on the cognition and perception of space, time, and other concepts, GIS data models, and especially GIS query languages and human interfaces, can be expected to contain artifacts of the language spoken by their designers, most commonly En...

متن کامل

Predicting Linguistic Structure with Incomplete and Cross-Lingual Supervision

Täckström, O. 2013. Predicting Linguistic Structure with Incomplete and Cross-Lingual Supervision. Acta Universitatis Upsaliensis. Studia Linguistica Upsaliensia 14. xii+215 pp. Uppsala. ISBN 978-91-554-8631-0. Contemporary approaches to natural language processing are predominantly based on statistical machine learning from large amounts of text, which has been manually annotated with the ling...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012